Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 193 |
| Missing cells | 1071 |
| Missing cells (%) | 21.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 40.7 KiB |
| Average record size in memory | 216.0 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 24 |
code has a high cardinality: 193 distinct values | High cardinality |
country has a high cardinality: 193 distinct values | High cardinality |
1st_const_year is highly correlated with 2nd_const_year and 7 other fields | High correlation |
2nd_const_year is highly correlated with 1st_const_year and 6 other fields | High correlation |
1st_2nd_tfidf is highly correlated with 1st_current_tfidf and 2 other fields | High correlation |
1st_current_tfidf is highly correlated with 1st_2nd_tfidf and 7 other fields | High correlation |
1st_2nd_lda is highly correlated with 1st_2nd_tfidf and 2 other fields | High correlation |
1st_current_lda is highly correlated with 1st_current_tfidf and 3 other fields | High correlation |
1st_2nd_use is highly correlated with 1st_2nd_lda and 4 other fields | High correlation |
1st_current_use is highly correlated with 1st_2nd_use and 2 other fields | High correlation |
1st_2nd_stm is highly correlated with 1st_2nd_lda and 1 other fields | High correlation |
1st_current_stm is highly correlated with 1st_current_tfidf and 4 other fields | High correlation |
constitutional_time is highly correlated with 1st_const_year and 7 other fields | High correlation |
first_regime_time is highly correlated with 1st_2nd_tfidf_adj and 3 other fields | High correlation |
1st_2nd_tfidf_adj is highly correlated with first_regime_time and 4 other fields | High correlation |
1st_2nd_lda_adj is highly correlated with first_regime_time and 3 other fields | High correlation |
1st_2nd_use_adj is highly correlated with 1st_2nd_use and 5 other fields | High correlation |
1st_2nd_stm_adj is highly correlated with first_regime_time and 3 other fields | High correlation |
1st_curr_tfidf_adj is highly correlated with 1st_current_tfidf and 3 other fields | High correlation |
1st_curr_lda_adj is highly correlated with 1st_const_year and 5 other fields | High correlation |
1st_curr_use_adj is highly correlated with 1st_current_use and 3 other fields | High correlation |
1st_curr_stm_adj is highly correlated with 1st_const_year and 5 other fields | High correlation |
tfidf_distance is highly correlated with 1st_const_year and 9 other fields | High correlation |
lda_distance is highly correlated with 1st_const_year and 8 other fields | High correlation |
use_distance is highly correlated with 1st_const_year and 8 other fields | High correlation |
stm_distance is highly correlated with 1st_const_year and 8 other fields | High correlation |
1st_const_year is highly correlated with 2nd_const_year and 6 other fields | High correlation |
2nd_const_year is highly correlated with 1st_const_year and 6 other fields | High correlation |
1st_2nd_tfidf is highly correlated with 1st_current_tfidf and 1 other fields | High correlation |
1st_current_tfidf is highly correlated with 1st_2nd_tfidf and 3 other fields | High correlation |
1st_2nd_lda is highly correlated with 1st_2nd_tfidf and 3 other fields | High correlation |
1st_current_lda is highly correlated with 1st_current_tfidf and 3 other fields | High correlation |
1st_2nd_use is highly correlated with 1st_2nd_lda and 2 other fields | High correlation |
1st_current_use is highly correlated with 1st_2nd_use and 1 other fields | High correlation |
1st_2nd_stm is highly correlated with 1st_2nd_lda and 1 other fields | High correlation |
1st_current_stm is highly correlated with 1st_current_tfidf and 4 other fields | High correlation |
constitutional_time is highly correlated with 1st_const_year and 6 other fields | High correlation |
1st_2nd_tfidf_adj is highly correlated with 1st_2nd_lda_adj and 2 other fields | High correlation |
1st_2nd_lda_adj is highly correlated with 1st_2nd_tfidf_adj and 2 other fields | High correlation |
1st_2nd_use_adj is highly correlated with 1st_2nd_tfidf_adj and 2 other fields | High correlation |
1st_2nd_stm_adj is highly correlated with 1st_2nd_tfidf_adj and 2 other fields | High correlation |
1st_curr_tfidf_adj is highly correlated with 1st_current_tfidf and 2 other fields | High correlation |
1st_curr_lda_adj is highly correlated with 1st_const_year and 5 other fields | High correlation |
1st_curr_use_adj is highly correlated with 1st_2nd_use and 2 other fields | High correlation |
1st_curr_stm_adj is highly correlated with 1st_const_year and 4 other fields | High correlation |
tfidf_distance is highly correlated with 1st_const_year and 5 other fields | High correlation |
lda_distance is highly correlated with 1st_const_year and 7 other fields | High correlation |
use_distance is highly correlated with tfidf_distance and 2 other fields | High correlation |
stm_distance is highly correlated with 1st_const_year and 6 other fields | High correlation |
1st_const_year is highly correlated with 2nd_const_year and 2 other fields | High correlation |
2nd_const_year is highly correlated with 1st_const_year and 1 other fields | High correlation |
1st_2nd_tfidf is highly correlated with 1st_current_tfidf | High correlation |
1st_current_tfidf is highly correlated with 1st_2nd_tfidf | High correlation |
1st_2nd_lda is highly correlated with 1st_2nd_stm | High correlation |
1st_current_lda is highly correlated with 1st_current_stm | High correlation |
1st_current_use is highly correlated with 1st_curr_use_adj and 1 other fields | High correlation |
1st_2nd_stm is highly correlated with 1st_2nd_lda | High correlation |
1st_current_stm is highly correlated with 1st_current_lda and 1 other fields | High correlation |
constitutional_time is highly correlated with 1st_const_year and 2 other fields | High correlation |
first_regime_time is highly correlated with 1st_2nd_tfidf_adj and 2 other fields | High correlation |
1st_2nd_tfidf_adj is highly correlated with first_regime_time and 2 other fields | High correlation |
1st_2nd_lda_adj is highly correlated with first_regime_time and 3 other fields | High correlation |
1st_2nd_use_adj is highly correlated with 1st_2nd_lda_adj and 1 other fields | High correlation |
1st_2nd_stm_adj is highly correlated with first_regime_time and 3 other fields | High correlation |
1st_curr_tfidf_adj is highly correlated with 1st_curr_lda_adj | High correlation |
1st_curr_lda_adj is highly correlated with 1st_const_year and 3 other fields | High correlation |
1st_curr_use_adj is highly correlated with 1st_current_use | High correlation |
1st_curr_stm_adj is highly correlated with 1st_curr_lda_adj | High correlation |
tfidf_distance is highly correlated with lda_distance and 2 other fields | High correlation |
lda_distance is highly correlated with tfidf_distance and 2 other fields | High correlation |
use_distance is highly correlated with 1st_current_use and 3 other fields | High correlation |
stm_distance is highly correlated with 1st_current_stm and 3 other fields | High correlation |
1st_const_year is highly correlated with 2nd_const_year and 6 other fields | High correlation |
2nd_const_year is highly correlated with 1st_const_year and 7 other fields | High correlation |
1st_2nd_tfidf is highly correlated with 1st_current_tfidf and 1 other fields | High correlation |
1st_current_tfidf is highly correlated with 1st_2nd_tfidf and 3 other fields | High correlation |
1st_2nd_lda is highly correlated with 1st_2nd_tfidf and 4 other fields | High correlation |
1st_current_lda is highly correlated with 1st_current_tfidf and 3 other fields | High correlation |
1st_2nd_use is highly correlated with 1st_current_use and 4 other fields | High correlation |
1st_current_use is highly correlated with 1st_2nd_use and 3 other fields | High correlation |
1st_2nd_stm is highly correlated with 1st_2nd_lda and 2 other fields | High correlation |
1st_current_stm is highly correlated with 1st_current_tfidf and 4 other fields | High correlation |
constitutional_time is highly correlated with 1st_const_year and 6 other fields | High correlation |
first_regime_time is highly correlated with 1st_const_year and 1 other fields | High correlation |
1st_2nd_tfidf_adj is highly correlated with 2nd_const_year and 7 other fields | High correlation |
1st_2nd_lda_adj is highly correlated with 2nd_const_year and 6 other fields | High correlation |
1st_2nd_use_adj is highly correlated with 1st_2nd_use and 7 other fields | High correlation |
1st_2nd_stm_adj is highly correlated with 1st_2nd_tfidf_adj and 3 other fields | High correlation |
1st_curr_tfidf_adj is highly correlated with 1st_const_year and 5 other fields | High correlation |
1st_curr_lda_adj is highly correlated with 1st_const_year and 4 other fields | High correlation |
1st_curr_use_adj is highly correlated with 1st_2nd_use and 4 other fields | High correlation |
1st_curr_stm_adj is highly correlated with 1st_const_year and 5 other fields | High correlation |
tfidf_distance is highly correlated with 2nd_const_year and 4 other fields | High correlation |
lda_distance is highly correlated with 1st_const_year and 10 other fields | High correlation |
use_distance is highly correlated with 2nd_const_year and 6 other fields | High correlation |
stm_distance is highly correlated with 2nd_const_year and 3 other fields | High correlation |
2nd_const_year has 63 (32.6%) missing values | Missing |
1st_2nd_tfidf has 63 (32.6%) missing values | Missing |
1st_current_tfidf has 63 (32.6%) missing values | Missing |
1st_2nd_lda has 63 (32.6%) missing values | Missing |
1st_current_lda has 63 (32.6%) missing values | Missing |
1st_2nd_use has 63 (32.6%) missing values | Missing |
1st_current_use has 63 (32.6%) missing values | Missing |
1st_2nd_stm has 63 (32.6%) missing values | Missing |
1st_current_stm has 63 (32.6%) missing values | Missing |
1st_2nd_tfidf_adj has 63 (32.6%) missing values | Missing |
1st_2nd_lda_adj has 63 (32.6%) missing values | Missing |
1st_2nd_use_adj has 63 (32.6%) missing values | Missing |
1st_2nd_stm_adj has 63 (32.6%) missing values | Missing |
1st_curr_tfidf_adj has 63 (32.6%) missing values | Missing |
1st_curr_lda_adj has 63 (32.6%) missing values | Missing |
1st_curr_use_adj has 63 (32.6%) missing values | Missing |
1st_curr_stm_adj has 63 (32.6%) missing values | Missing |
code is uniformly distributed | Uniform |
country is uniformly distributed | Uniform |
code has unique values | Unique |
country has unique values | Unique |
tfidf_distance has 63 (32.6%) zeros | Zeros |
lda_distance has 63 (32.6%) zeros | Zeros |
use_distance has 63 (32.6%) zeros | Zeros |
stm_distance has 63 (32.6%) zeros | Zeros |
Reproduction
| Analysis started | 2022-06-15 18:46:34.245667 |
|---|---|
| Analysis finished | 2022-06-15 18:55:55.824472 |
| Duration | 9 minutes and 21.58 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 193 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 KiB |
| AFG | 1 |
|---|---|
| LIE | 1 |
| NZL | 1 |
| NIC | 1 |
| NER | 1 |
| Other values (188) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.005181347 |
| Min length | 3 |
Characters and Unicode
| Total characters | 580 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 193 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AFG |
|---|---|
| 2nd row | ALB |
| 3rd row | DZA |
| 4th row | AND |
| 5th row | AGO |
Common Values
| Value | Count | Frequency (%) |
| AFG | 1 | 0.5% |
| LIE | 1 | 0.5% |
| NZL | 1 | 0.5% |
| NIC | 1 | 0.5% |
| NER | 1 | 0.5% |
| NGA | 1 | 0.5% |
| PRK | 1 | 0.5% |
| NOR | 1 | 0.5% |
| OMN | 1 | 0.5% |
| PAK | 1 | 0.5% |
| Other values (183) | 183 |
Length
| Value | Count | Frequency (%) |
| afg | 1 | 0.5% |
| alb | 1 | 0.5% |
| bhs | 1 | 0.5% |
| dza | 1 | 0.5% |
| and | 1 | 0.5% |
| ago | 1 | 0.5% |
| atg | 1 | 0.5% |
| arg | 1 | 0.5% |
| arm | 1 | 0.5% |
| aus | 1 | 0.5% |
| Other values (183) | 183 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 580 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 580 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 580 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 44 | 7.6% |
| R | 43 | 7.4% |
| N | 40 | 6.9% |
| M | 36 | 6.2% |
| L | 34 | 5.9% |
| S | 33 | 5.7% |
| T | 32 | 5.5% |
| G | 29 | 5.0% |
| B | 28 | 4.8% |
| E | 26 | 4.5% |
| Other values (16) | 235 |
| Distinct | 193 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 KiB |
| Afghanistan | 1 |
|---|---|
| Liechtenstein | 1 |
| New zealand | 1 |
| Nicaragua | 1 |
| Niger | 1 |
| Other values (188) |
Length
| Max length | 32 |
|---|---|
| Median length | 21 |
| Mean length | 8.455958549 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1632 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 193 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Afghanistan |
|---|---|
| 2nd row | Albania |
| 3rd row | Algeria |
| 4th row | Andorra |
| 5th row | Angola |
Common Values
| Value | Count | Frequency (%) |
| Afghanistan | 1 | 0.5% |
| Liechtenstein | 1 | 0.5% |
| New zealand | 1 | 0.5% |
| Nicaragua | 1 | 0.5% |
| Niger | 1 | 0.5% |
| Nigeria | 1 | 0.5% |
| North Korea | 1 | 0.5% |
| Norway | 1 | 0.5% |
| Oman | 1 | 0.5% |
| Pakistan | 1 | 0.5% |
| Other values (183) | 183 |
Length
| Value | Count | Frequency (%) |
| republic | 5 | 2.1% |
| and | 4 | 1.7% |
| guinea | 4 | 1.7% |
| saint | 3 | 1.2% |
| south | 3 | 1.2% |
| the | 2 | 0.8% |
| korea | 2 | 0.8% |
| united | 2 | 0.8% |
| sudan | 2 | 0.8% |
| arab | 2 | 0.8% |
| Other values (207) | 211 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | 8.8% |
| n | 125 | 7.7% |
| e | 109 | 6.7% |
| o | 90 | 5.5% |
| r | 89 | 5.5% |
| u | 66 | 4.0% |
| t | 62 | 3.8% |
| l | 60 | 3.7% |
| s | 52 | 3.2% |
| Other values (41) | 580 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1352 | |
| Uppercase Letter | 233 | 14.3% |
| Space Separator | 47 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | |
| n | 125 | |
| e | 109 | 8.1% |
| o | 90 | 6.7% |
| r | 89 | 6.6% |
| u | 66 | 4.9% |
| t | 62 | 4.6% |
| l | 60 | 4.4% |
| s | 52 | 3.8% |
| Other values (16) | 300 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 27 | 11.6% |
| M | 19 | 8.2% |
| B | 18 | 7.7% |
| C | 18 | 7.7% |
| A | 17 | 7.3% |
| T | 15 | 6.4% |
| G | 15 | 6.4% |
| N | 12 | 5.2% |
| L | 12 | 5.2% |
| I | 10 | 4.3% |
| Other values (14) | 70 |
Space Separator
| Value | Count | Frequency (%) |
| 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1585 | |
| Common | 47 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | 9.1% |
| n | 125 | 7.9% |
| e | 109 | 6.9% |
| o | 90 | 5.7% |
| r | 89 | 5.6% |
| u | 66 | 4.2% |
| t | 62 | 3.9% |
| l | 60 | 3.8% |
| s | 52 | 3.3% |
| Other values (40) | 533 |
Common
| Value | Count | Frequency (%) |
| 47 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1632 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 255 | |
| i | 144 | 8.8% |
| n | 125 | 7.7% |
| e | 109 | 6.7% |
| o | 90 | 5.5% |
| r | 89 | 5.5% |
| u | 66 | 4.0% |
| t | 62 | 3.8% |
| l | 60 | 3.7% |
| s | 52 | 3.2% |
| Other values (41) | 580 |
1st_const_year
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 99 |
|---|---|
| Distinct (%) | 51.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1935.792746 |
| Minimum | 1789 |
|---|---|
| Maximum | 2011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 1789 |
|---|---|
| 5-th percentile | 1816.4 |
| Q1 | 1919 |
| median | 1960 |
| Q3 | 1975 |
| 95-th percentile | 1995 |
| Maximum | 2011 |
| Range | 222 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 57.59212321 |
|---|---|
| Coefficient of variation (CV) | 0.02975118247 |
| Kurtosis | 0.1093821862 |
| Mean | 1935.792746 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | -1.139208249 |
| Sum | 373608 |
| Variance | 3316.852655 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1960 | 9 | 4.7% |
| 1962 | 8 | 4.1% |
| 1978 | 6 | 3.1% |
| 1947 | 5 | 2.6% |
| 1991 | 5 | 2.6% |
| 1979 | 5 | 2.6% |
| 1964 | 5 | 2.6% |
| 1981 | 5 | 2.6% |
| 1975 | 5 | 2.6% |
| 1961 | 5 | 2.6% |
| Other values (89) | 135 |
| Value | Count | Frequency (%) |
| 1789 | 1 | |
| 1791 | 2 | |
| 1795 | 1 | |
| 1801 | 1 | |
| 1808 | 1 | |
| 1809 | 1 | |
| 1811 | 1 | |
| 1813 | 1 | |
| 1814 | 1 | |
| 1818 | 1 |
| Value | Count | Frequency (%) |
| 2011 | 1 | 0.5% |
| 2008 | 1 | 0.5% |
| 2003 | 1 | 0.5% |
| 2002 | 1 | 0.5% |
| 1998 | 1 | 0.5% |
| 1997 | 1 | 0.5% |
| 1996 | 2 | |
| 1995 | 4 | |
| 1994 | 2 | |
| 1993 | 3 |
2nd_const_year
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 83 |
|---|---|
| Distinct (%) | 63.8% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1950.130769 |
| Minimum | 1793 |
|---|---|
| Maximum | 2011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 1793 |
|---|---|
| 5-th percentile | 1828.9 |
| Q1 | 1931 |
| median | 1969.5 |
| Q3 | 1989.75 |
| 95-th percentile | 2007.55 |
| Maximum | 2011 |
| Range | 218 |
| Interquartile range (IQR) | 58.75 |
Descriptive statistics
| Standard deviation | 53.9311057 |
|---|---|
| Coefficient of variation (CV) | 0.0276551227 |
| Kurtosis | 0.7794133577 |
| Mean | 1950.130769 |
| Median Absolute Deviation (MAD) | 24.5 |
| Skewness | -1.284927154 |
| Sum | 253517 |
| Variance | 2908.564162 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1979 | 4 | 2.1% |
| 1992 | 4 | 2.1% |
| 1962 | 4 | 2.1% |
| 1974 | 3 | 1.6% |
| 1996 | 3 | 1.6% |
| 2005 | 3 | 1.6% |
| 1970 | 3 | 1.6% |
| 2008 | 3 | 1.6% |
| 1978 | 3 | 1.6% |
| 1959 | 2 | 1.0% |
| Other values (73) | 98 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 1793 | 1 | |
| 1805 | 1 | |
| 1812 | 1 | |
| 1815 | 1 | |
| 1823 | 1 | |
| 1826 | 1 | |
| 1828 | 1 | |
| 1830 | 1 | |
| 1831 | 1 | |
| 1836 | 1 |
| Value | Count | Frequency (%) |
| 2011 | 2 | |
| 2010 | 2 | |
| 2008 | 3 | |
| 2007 | 1 | 0.5% |
| 2005 | 3 | |
| 2002 | 2 | |
| 2001 | 2 | |
| 1999 | 1 | 0.5% |
| 1998 | 1 | 0.5% |
| 1996 | 3 |
1st_2nd_tfidf
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5546450492 |
| Minimum | 0.0201733 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0201733 |
|---|---|
| 5-th percentile | 0.067218377 |
| Q1 | 0.3511306875 |
| median | 0.57103248 |
| Q3 | 0.826286805 |
| 95-th percentile | 0.961606845 |
| Maximum | 1 |
| Range | 0.9798267 |
| Interquartile range (IQR) | 0.4751561175 |
Descriptive statistics
| Standard deviation | 0.2897924935 |
|---|---|
| Coefficient of variation (CV) | 0.5224827913 |
| Kurtosis | -1.039943357 |
| Mean | 0.5546450492 |
| Median Absolute Deviation (MAD) | 0.241428725 |
| Skewness | -0.2278169548 |
| Sum | 72.1038564 |
| Variance | 0.08397968929 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3536304 | 1 | 0.5% |
| 0.39374864 | 1 | 0.5% |
| 0.98284872 | 1 | 0.5% |
| 0.57017142 | 1 | 0.5% |
| 0.89508546 | 1 | 0.5% |
| 0.0300009 | 1 | 0.5% |
| 0.77503286 | 1 | 0.5% |
| 0.14577967 | 1 | 0.5% |
| 0.80957755 | 1 | 0.5% |
| 0.6414186 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0201733 | 1 | |
| 0.0300009 | 1 | |
| 0.03110075 | 1 | |
| 0.04367113 | 1 | |
| 0.0565919 | 1 | |
| 0.0569241 | 1 | |
| 0.06451124 | 1 | |
| 0.0705271 | 1 | |
| 0.0705558 | 1 | |
| 0.08149785 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.9976256217 | 1 | |
| 0.98284872 | 1 | |
| 0.976940108 | 1 | |
| 0.974659728 | 1 | |
| 0.968144733 | 1 | |
| 0.962688424 | 1 | |
| 0.960284915 | 1 | |
| 0.95103436 | 1 | |
| 0.94884168 | 1 |
1st_current_tfidf
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6039565952 |
| Minimum | 0.0201733 |
|---|---|
| Maximum | 1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0201733 |
|---|---|
| 5-th percentile | 0.0761215455 |
| Q1 | 0.372131245 |
| median | 0.64427824 |
| Q3 | 0.8753182905 |
| 95-th percentile | 0.9830993891 |
| Maximum | 1 |
| Range | 0.9798267 |
| Interquartile range (IQR) | 0.5031870455 |
Descriptive statistics
| Standard deviation | 0.2953947897 |
|---|---|
| Coefficient of variation (CV) | 0.4890993691 |
| Kurtosis | -1.025800126 |
| Mean | 0.6039565952 |
| Median Absolute Deviation (MAD) | 0.247694305 |
| Skewness | -0.4713303774 |
| Sum | 78.51435738 |
| Variance | 0.08725808177 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3035168 | 1 | 0.5% |
| 0.4996174 | 1 | 0.5% |
| 0.98284872 | 1 | 0.5% |
| 0.74367234 | 1 | 0.5% |
| 0.73904103 | 1 | 0.5% |
| 0.2395039 | 1 | 0.5% |
| 0.4761298 | 1 | 0.5% |
| 0.1445826 | 1 | 0.5% |
| 0.921665385 | 1 | 0.5% |
| 0.983304482 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0201733 | 1 | |
| 0.03110075 | 1 | |
| 0.0565919 | 1 | |
| 0.0569241 | 1 | |
| 0.0605006 | 1 | |
| 0.06490505 | 1 | |
| 0.07049835 | 1 | |
| 0.08299434 | 1 | |
| 0.08384097 | 1 | |
| 0.08689183 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.9976256217 | 1 | |
| 0.9936541156 | 1 | |
| 0.990213237 | 1 | |
| 0.98426032 | 1 | |
| 0.98410011 | 1 | |
| 0.983304482 | 1 | |
| 0.98284872 | 1 | |
| 0.976940108 | 1 | |
| 0.96477796 | 1 |
1st_2nd_lda
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.443165611 |
| Minimum | 0.0009248798472 |
|---|---|
| Maximum | 0.8324159977 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0009248798472 |
|---|---|
| 5-th percentile | 0.1120276952 |
| Q1 | 0.2944028361 |
| median | 0.4365144665 |
| Q3 | 0.5777939292 |
| 95-th percentile | 0.8067003927 |
| Maximum | 0.8324159977 |
| Range | 0.8314911179 |
| Interquartile range (IQR) | 0.283391093 |
Descriptive statistics
| Standard deviation | 0.2031193633 |
|---|---|
| Coefficient of variation (CV) | 0.4583373761 |
| Kurtosis | -0.6423280871 |
| Mean | 0.443165611 |
| Median Absolute Deviation (MAD) | 0.1429888014 |
| Skewness | 0.04752162724 |
| Sum | 57.61152943 |
| Variance | 0.04125747575 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.4902039477 | 1 | 0.5% |
| 0.4247056517 | 1 | 0.5% |
| 0.7338111011 | 1 | 0.5% |
| 0.1832512237 | 1 | 0.5% |
| 0.5699860148 | 1 | 0.5% |
| 0.05153921136 | 1 | 0.5% |
| 0.1837521032 | 1 | 0.5% |
| 0.1090214883 | 1 | 0.5% |
| 0.6453213329 | 1 | 0.5% |
| 0.6659218507 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0009248798472 | 1 | |
| 0.009009424209 | 1 | |
| 0.02491735492 | 1 | |
| 0.05153921136 | 1 | |
| 0.106568484 | 1 | |
| 0.1090214883 | 1 | |
| 0.1107000823 | 1 | |
| 0.1136503332 | 1 | |
| 0.1224426352 | 1 | |
| 0.1570453903 | 1 |
| Value | Count | Frequency (%) |
| 0.8324159977 | 1 | |
| 0.8322413363 | 1 | |
| 0.831322489 | 1 | |
| 0.8247713524 | 1 | |
| 0.8150825904 | 1 | |
| 0.8112244864 | 1 | |
| 0.810847399 | 1 | |
| 0.8016318293 | 1 | |
| 0.777378254 | 1 | |
| 0.7398105908 | 1 |
1st_current_lda
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5835165935 |
| Minimum | 0.009009424209 |
|---|---|
| Maximum | 0.8324257359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.009009424209 |
|---|---|
| 5-th percentile | 0.2251729199 |
| Q1 | 0.4804396153 |
| median | 0.6170690393 |
| Q3 | 0.731420406 |
| 95-th percentile | 0.8179054265 |
| Maximum | 0.8324257359 |
| Range | 0.8234163117 |
| Interquartile range (IQR) | 0.2509807907 |
Descriptive statistics
| Standard deviation | 0.1909343338 |
|---|---|
| Coefficient of variation (CV) | 0.3272132035 |
| Kurtosis | 0.277182096 |
| Mean | 0.5835165935 |
| Median Absolute Deviation (MAD) | 0.119330462 |
| Skewness | -0.8797226781 |
| Sum | 75.85715715 |
| Variance | 0.03645591984 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6341485178 | 1 | 0.5% |
| 0.5404719296 | 1 | 0.5% |
| 0.7338111011 | 1 | 0.5% |
| 0.6720473046 | 1 | 0.5% |
| 0.7431631803 | 1 | 0.5% |
| 0.4167833364 | 1 | 0.5% |
| 0.2256352177 | 1 | 0.5% |
| 0.04773825738 | 1 | 0.5% |
| 0.8312122451 | 1 | 0.5% |
| 0.8022149626 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.009009424209 | 1 | |
| 0.04773825738 | 1 | |
| 0.106568484 | 1 | |
| 0.1136503332 | 1 | |
| 0.1423339059 | 1 | |
| 0.1482686852 | 1 | |
| 0.2247946763 | 1 | |
| 0.2256352177 | 1 | |
| 0.2632287717 | 1 | |
| 0.2703033666 | 1 |
| Value | Count | Frequency (%) |
| 0.8324257359 | 1 | |
| 0.8324159977 | 1 | |
| 0.8321897687 | 1 | |
| 0.8312122451 | 1 | |
| 0.8246794945 | 1 | |
| 0.8213321706 | 1 | |
| 0.8202150196 | 1 | |
| 0.8150825904 | 1 | |
| 0.8131771201 | 1 | |
| 0.8112244864 | 1 |
| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1056574339 |
| Minimum | 0.005343358713 |
|---|---|
| Maximum | 0.4208145276 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.005343358713 |
|---|---|
| 5-th percentile | 0.01089700337 |
| Q1 | 0.03987112141 |
| median | 0.06771642546 |
| Q3 | 0.157467436 |
| 95-th percentile | 0.3063234332 |
| Maximum | 0.4208145276 |
| Range | 0.4154711689 |
| Interquartile range (IQR) | 0.1175963146 |
Descriptive statistics
| Standard deviation | 0.09324338942 |
|---|---|
| Coefficient of variation (CV) | 0.8825066631 |
| Kurtosis | 1.081367524 |
| Mean | 0.1056574339 |
| Median Absolute Deviation (MAD) | 0.03885217586 |
| Skewness | 1.322426056 |
| Sum | 13.73546641 |
| Variance | 0.008694329671 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.05512329385 | 1 | 0.5% |
| 0.05099676958 | 1 | 0.5% |
| 0.3285916033 | 1 | 0.5% |
| 0.01088671013 | 1 | 0.5% |
| 0.04217074802 | 1 | 0.5% |
| 0.00715785513 | 1 | 0.5% |
| 0.05273526816 | 1 | 0.5% |
| 0.01315690476 | 1 | 0.5% |
| 0.2756207437 | 1 | 0.5% |
| 0.3081018993 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.005343358713 | 1 | |
| 0.006509150755 | 1 | |
| 0.00693760752 | 1 | |
| 0.007083448061 | 1 | |
| 0.00715785513 | 1 | |
| 0.01004561837 | 1 | |
| 0.01088671013 | 1 | |
| 0.010909584 | 1 | |
| 0.01315690476 | 1 | |
| 0.01458493451 | 1 |
| Value | Count | Frequency (%) |
| 0.4208145276 | 1 | |
| 0.3867030993 | 1 | |
| 0.3642154469 | 1 | |
| 0.3285916033 | 1 | |
| 0.3113730362 | 1 | |
| 0.3081018993 | 1 | |
| 0.3070696313 | 1 | |
| 0.3054114134 | 1 | |
| 0.3009266105 | 1 | |
| 0.3005765842 | 1 |
1st_current_use
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1435744226 |
| Minimum | 0.004723398218 |
|---|---|
| Maximum | 0.5232621873 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.004723398218 |
|---|---|
| 5-th percentile | 0.02655054297 |
| Q1 | 0.07085178409 |
| median | 0.1205537431 |
| Q3 | 0.1908757909 |
| 95-th percentile | 0.3132243149 |
| Maximum | 0.5232621873 |
| Range | 0.5185387891 |
| Interquartile range (IQR) | 0.1200240068 |
Descriptive statistics
| Standard deviation | 0.09983241559 |
|---|---|
| Coefficient of variation (CV) | 0.6953356578 |
| Kurtosis | 1.864928993 |
| Mean | 0.1435744226 |
| Median Absolute Deviation (MAD) | 0.05930442002 |
| Skewness | 1.24056138 |
| Sum | 18.66467494 |
| Variance | 0.009966511202 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1669640441 | 1 | 0.5% |
| 0.08687377779 | 1 | 0.5% |
| 0.3285916033 | 1 | 0.5% |
| 0.08684611184 | 1 | 0.5% |
| 0.1512387381 | 1 | 0.5% |
| 0.07166519638 | 1 | 0.5% |
| 0.09127517087 | 1 | 0.5% |
| 0.00896903061 | 1 | 0.5% |
| 0.5029965022 | 1 | 0.5% |
| 0.1078774117 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.004723398218 | 1 | |
| 0.005343358713 | 1 | |
| 0.00896903061 | 1 | |
| 0.009546910405 | 1 | |
| 0.010909584 | 1 | |
| 0.01349929613 | 1 | |
| 0.02200860971 | 1 | |
| 0.03210179473 | 1 | |
| 0.03330747769 | 1 | |
| 0.03364362066 | 1 |
| Value | Count | Frequency (%) |
| 0.5232621873 | 1 | |
| 0.5029965022 | 1 | |
| 0.4208145276 | 1 | |
| 0.3642154469 | 1 | |
| 0.3380186079 | 1 | |
| 0.3285916033 | 1 | |
| 0.3143392196 | 1 | |
| 0.3118616537 | 1 | |
| 0.3070696313 | 1 | |
| 0.3057088118 | 1 |
1st_2nd_stm
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4255534369 |
| Minimum | 0.05346783526 |
|---|---|
| Maximum | 0.8310721109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.05346783526 |
|---|---|
| 5-th percentile | 0.1345424176 |
| Q1 | 0.2675765435 |
| median | 0.3944332318 |
| Q3 | 0.57040201 |
| 95-th percentile | 0.7747892121 |
| Maximum | 0.8310721109 |
| Range | 0.7776042756 |
| Interquartile range (IQR) | 0.3028254664 |
Descriptive statistics
| Standard deviation | 0.1973448023 |
|---|---|
| Coefficient of variation (CV) | 0.4637368312 |
| Kurtosis | -0.7112521818 |
| Mean | 0.4255534369 |
| Median Absolute Deviation (MAD) | 0.1363291275 |
| Skewness | 0.3402544825 |
| Sum | 55.3219468 |
| Variance | 0.03894497101 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.1808094633 | 1 | 0.5% |
| 0.3401846372 | 1 | 0.5% |
| 0.7927069066 | 1 | 0.5% |
| 0.2597674839 | 1 | 0.5% |
| 0.1086374728 | 1 | 0.5% |
| 0.2751306334 | 1 | 0.5% |
| 0.1426708258 | 1 | 0.5% |
| 0.2371090592 | 1 | 0.5% |
| 0.496686597 | 1 | 0.5% |
| 0.7635874396 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.05346783526 | 1 | |
| 0.06301731907 | 1 | |
| 0.06664675472 | 1 | |
| 0.09193017521 | 1 | |
| 0.103935994 | 1 | |
| 0.1086374728 | 1 | |
| 0.1278919017 | 1 | |
| 0.1426708258 | 1 | |
| 0.1674803379 | 1 | |
| 0.1795426932 | 1 |
| Value | Count | Frequency (%) |
| 0.8310721109 | 1 | |
| 0.8302623488 | 1 | |
| 0.8220312078 | 1 | |
| 0.8139319874 | 1 | |
| 0.8129855665 | 1 | |
| 0.7927069066 | 1 | |
| 0.7750710625 | 1 | |
| 0.7744447283 | 1 | |
| 0.7646634819 | 1 | |
| 0.7636754462 | 1 |
1st_current_stm
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5528050629 |
| Minimum | 0.06664675472 |
|---|---|
| Maximum | 0.8312656847 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.06664675472 |
|---|---|
| 5-th percentile | 0.207770493 |
| Q1 | 0.3816559025 |
| median | 0.5801652868 |
| Q3 | 0.7475594909 |
| 95-th percentile | 0.8257300116 |
| Maximum | 0.8312656847 |
| Range | 0.7646189299 |
| Interquartile range (IQR) | 0.3659035884 |
Descriptive statistics
| Standard deviation | 0.2059741367 |
|---|---|
| Coefficient of variation (CV) | 0.3725981371 |
| Kurtosis | -1.016112254 |
| Mean | 0.5528050629 |
| Median Absolute Deviation (MAD) | 0.188419597 |
| Skewness | -0.3125395666 |
| Sum | 71.86465818 |
| Variance | 0.04242534497 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3514579609 | 1 | 0.5% |
| 0.6599127299 | 1 | 0.5% |
| 0.7927069066 | 1 | 0.5% |
| 0.7922989702 | 1 | 0.5% |
| 0.8312656847 | 1 | 0.5% |
| 0.5429023615 | 1 | 0.5% |
| 0.2147874609 | 1 | 0.5% |
| 0.1978936538 | 1 | 0.5% |
| 0.7989632322 | 1 | 0.5% |
| 0.8304725389 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.06664675472 | 1 | |
| 0.08709304174 | 1 | |
| 0.103935994 | 1 | |
| 0.1905146018 | 1 | |
| 0.1978936538 | 1 | |
| 0.2014834105 | 1 | |
| 0.2020293374 | 1 | |
| 0.2147874609 | 1 | |
| 0.2274341764 | 1 | |
| 0.2287969862 | 1 |
| Value | Count | Frequency (%) |
| 0.8312656847 | 1 | |
| 0.8310721109 | 1 | |
| 0.8304725389 | 1 | |
| 0.8302623488 | 1 | |
| 0.8294142647 | 1 | |
| 0.8263893582 | 1 | |
| 0.8261795564 | 1 | |
| 0.8251805678 | 1 | |
| 0.8231224917 | 1 | |
| 0.8229771521 | 1 |
constitutional_time
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 99 |
|---|---|
| Distinct (%) | 51.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.20725389 |
| Minimum | 11 |
|---|---|
| Maximum | 233 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 47 |
| median | 62 |
| Q3 | 103 |
| 95-th percentile | 205.6 |
| Maximum | 233 |
| Range | 222 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 57.59212321 |
|---|---|
| Coefficient of variation (CV) | 0.66806586 |
| Kurtosis | 0.1093821862 |
| Mean | 86.20725389 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 1.139208249 |
| Sum | 16638 |
| Variance | 3316.852655 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 9 | 4.7% |
| 60 | 8 | 4.1% |
| 44 | 6 | 3.1% |
| 75 | 5 | 2.6% |
| 31 | 5 | 2.6% |
| 43 | 5 | 2.6% |
| 58 | 5 | 2.6% |
| 41 | 5 | 2.6% |
| 47 | 5 | 2.6% |
| 61 | 5 | 2.6% |
| Other values (89) | 135 |
| Value | Count | Frequency (%) |
| 11 | 1 | 0.5% |
| 14 | 1 | 0.5% |
| 19 | 1 | 0.5% |
| 20 | 1 | 0.5% |
| 24 | 1 | 0.5% |
| 25 | 1 | 0.5% |
| 26 | 2 | |
| 27 | 4 | |
| 28 | 2 | |
| 29 | 3 |
| Value | Count | Frequency (%) |
| 233 | 1 | |
| 231 | 2 | |
| 227 | 1 | |
| 221 | 1 | |
| 214 | 1 | |
| 213 | 1 | |
| 211 | 1 | |
| 209 | 1 | |
| 208 | 1 | |
| 204 | 1 |
| Distinct | 80 |
|---|---|
| Distinct (%) | 41.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.79792746 |
| Minimum | 1 |
|---|---|
| Maximum | 233 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 14 |
| median | 27 |
| Q3 | 48 |
| 95-th percentile | 110.2 |
| Maximum | 233 |
| Range | 232 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 39.51879294 |
|---|---|
| Coefficient of variation (CV) | 1.045528038 |
| Kurtosis | 7.385515371 |
| Mean | 37.79792746 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 2.469340765 |
| Sum | 7295 |
| Variance | 1561.734996 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 7 | 3.6% |
| 14 | 7 | 3.6% |
| 16 | 6 | 3.1% |
| 4 | 6 | 3.1% |
| 3 | 5 | 2.6% |
| 19 | 5 | 2.6% |
| 31 | 5 | 2.6% |
| 5 | 5 | 2.6% |
| 15 | 5 | 2.6% |
| 28 | 5 | 2.6% |
| Other values (70) | 137 |
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 2 | 3 | |
| 3 | 5 | |
| 4 | 6 | |
| 5 | 5 | |
| 6 | 4 | |
| 7 | 1 | 0.5% |
| 8 | 3 | |
| 9 | 3 | |
| 10 | 3 |
| Value | Count | Frequency (%) |
| 233 | 1 | |
| 208 | 1 | |
| 201 | 1 | |
| 191 | 1 | |
| 170 | 1 | |
| 165 | 1 | |
| 154 | 1 | |
| 147 | 1 | |
| 139 | 1 | |
| 121 | 1 |
1st_2nd_tfidf_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06468672845 |
| Minimum | 0.000502996 |
|---|---|
| Maximum | 0.907096975 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.000502996 |
|---|---|
| 5-th percentile | 0.004316380572 |
| Q1 | 0.01068151816 |
| median | 0.02548487012 |
| Q3 | 0.0555340769 |
| 95-th percentile | 0.2378890005 |
| Maximum | 0.907096975 |
| Range | 0.906593979 |
| Interquartile range (IQR) | 0.04485255874 |
Descriptive statistics
| Standard deviation | 0.131293906 |
|---|---|
| Coefficient of variation (CV) | 2.029688456 |
| Kurtosis | 26.298727 |
| Mean | 0.06468672845 |
| Median Absolute Deviation (MAD) | 0.01663822712 |
| Skewness | 4.776623824 |
| Sum | 8.409274699 |
| Variance | 0.01723808975 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.0442038 | 1 | 0.5% |
| 0.01093746222 | 1 | 0.5% |
| 0.049142436 | 1 | 0.5% |
| 0.016290612 | 1 | 0.5% |
| 0.03086501586 | 1 | 0.5% |
| 0.0100003 | 1 | 0.5% |
| 0.03229303583 | 1 | 0.5% |
| 0.02429661167 | 1 | 0.5% |
| 0.02611540484 | 1 | 0.5% |
| 0.04276124 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.000502996 | 1 | |
| 0.0008017704545 | 1 | |
| 0.00201733 | 1 | |
| 0.002350504317 | 1 | |
| 0.003794420833 | 1 | |
| 0.004042278571 | 1 | |
| 0.00406967027 | 1 | |
| 0.004617915385 | 1 | |
| 0.00465801125 | 1 | |
| 0.0050721 | 1 |
| Value | Count | Frequency (%) |
| 0.907096975 | 1 | |
| 0.90364835 | 1 | |
| 0.55195415 | 1 | |
| 0.35231498 | 1 | |
| 0.322714911 | 1 | |
| 0.31628056 | 1 | |
| 0.2379957 | 1 | |
| 0.23775859 | 1 | |
| 0.227624476 | 1 | |
| 0.20651088 | 1 |
1st_2nd_lda_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04951300263 |
| Minimum | 3.557230182 × 10-5 |
|---|---|
| Maximum | 0.8016318293 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 3.557230182 × 10-5 |
|---|---|
| 5-th percentile | 0.006485400323 |
| Q1 | 0.01182460787 |
| median | 0.02055020617 |
| Q3 | 0.04148586937 |
| 95-th percentile | 0.1757328853 |
| Maximum | 0.8016318293 |
| Range | 0.801596257 |
| Interquartile range (IQR) | 0.02966126151 |
Descriptive statistics
| Standard deviation | 0.1071385071 |
|---|---|
| Coefficient of variation (CV) | 2.163845888 |
| Kurtosis | 35.14639059 |
| Mean | 0.04951300263 |
| Median Absolute Deviation (MAD) | 0.01088323906 |
| Skewness | 5.585072789 |
| Sum | 6.436690342 |
| Variance | 0.01147865971 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.06127549346 | 1 | 0.5% |
| 0.01179737921 | 1 | 0.5% |
| 0.03669055506 | 1 | 0.5% |
| 0.005235749249 | 1 | 0.5% |
| 0.01965469016 | 1 | 0.5% |
| 0.01717973712 | 1 | 0.5% |
| 0.007656337635 | 1 | 0.5% |
| 0.01817024805 | 1 | 0.5% |
| 0.02081681719 | 1 | 0.5% |
| 0.04439479005 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 3.557230182 × 10-5 | 1 | |
| 0.002620096689 | 1 | |
| 0.003180420336 | 1 | |
| 0.004033466684 | 1 | |
| 0.005235749249 | 1 | |
| 0.006174361731 | 1 | |
| 0.006477954831 | 1 | |
| 0.006494500368 | 1 | |
| 0.00649978057 | 1 | |
| 0.006762087726 | 1 |
| Value | Count | Frequency (%) |
| 0.8016318293 | 1 | |
| 0.777378254 | 1 | |
| 0.4077143686 | 1 | |
| 0.2328861703 | 1 | |
| 0.1969738114 | 1 | |
| 0.187437504 | 1 | |
| 0.179047086 | 1 | |
| 0.1716821955 | 1 | |
| 0.1490337043 | 1 | |
| 0.1458244548 | 1 |
1st_2nd_use_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01530081815 |
| Minimum | 0.0002758478593 |
|---|---|
| Maximum | 0.3867030993 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0002758478593 |
|---|---|
| 5-th percentile | 0.000782763203 |
| Q1 | 0.001691636004 |
| median | 0.003519769878 |
| Q3 | 0.009301449275 |
| 95-th percentile | 0.04838330805 |
| Maximum | 0.3867030993 |
| Range | 0.3864272515 |
| Interquartile range (IQR) | 0.00760981327 |
Descriptive statistics
| Standard deviation | 0.04763135418 |
|---|---|
| Coefficient of variation (CV) | 3.11299394 |
| Kurtosis | 40.10922311 |
| Mean | 0.01530081815 |
| Median Absolute Deviation (MAD) | 0.002457200481 |
| Skewness | 6.089131799 |
| Sum | 1.989106359 |
| Variance | 0.002268745901 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.006890411732 | 1 | 0.5% |
| 0.001416576933 | 1 | 0.5% |
| 0.01642958016 | 1 | 0.5% |
| 0.0003110488609 | 1 | 0.5% |
| 0.001454163725 | 1 | 0.5% |
| 0.00238595171 | 1 | 0.5% |
| 0.00219730284 | 1 | 0.5% |
| 0.00219281746 | 1 | 0.5% |
| 0.008890991733 | 1 | 0.5% |
| 0.02054012662 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0002758478593 | 1 | |
| 0.0003110488609 | 1 | |
| 0.000346880376 | 1 | |
| 0.0007009087638 | 1 | |
| 0.0007175441693 | 1 | |
| 0.0007365845831 | 1 | |
| 0.000779256 | 1 | |
| 0.0007870497845 | 1 | |
| 0.0007882253443 | 1 | |
| 0.00086396236 | 1 |
| Value | Count | Frequency (%) |
| 0.3867030993 | 1 | |
| 0.3005765842 | 1 | |
| 0.2158303006 | 1 | |
| 0.1053613701 | 1 | |
| 0.08750394489 | 1 | |
| 0.0553042328 | 1 | |
| 0.05238983032 | 1 | |
| 0.04348644751 | 1 | |
| 0.03195685445 | 1 | |
| 0.02834948051 | 1 |
1st_2nd_stm_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05353556076 |
| Minimum | 0.001682788181 |
|---|---|
| Maximum | 0.7744447283 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.001682788181 |
|---|---|
| 5-th percentile | 0.005117488186 |
| Q1 | 0.01071727296 |
| median | 0.02048012628 |
| Q3 | 0.04495070752 |
| 95-th percentile | 0.1943520418 |
| Maximum | 0.7744447283 |
| Range | 0.7727619401 |
| Interquartile range (IQR) | 0.03423343456 |
Descriptive statistics
| Standard deviation | 0.1151493406 |
|---|---|
| Coefficient of variation (CV) | 2.150894452 |
| Kurtosis | 26.22181773 |
| Mean | 0.05353556076 |
| Median Absolute Deviation (MAD) | 0.01162726414 |
| Skewness | 4.89588739 |
| Sum | 6.959622899 |
| Variance | 0.01325937065 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.02260118292 | 1 | 0.5% |
| 0.009449573254 | 1 | 0.5% |
| 0.03963534533 | 1 | 0.5% |
| 0.007421928113 | 1 | 0.5% |
| 0.003746119751 | 1 | 0.5% |
| 0.09171021114 | 1 | 0.5% |
| 0.005944617741 | 1 | 0.5% |
| 0.03951817653 | 1 | 0.5% |
| 0.01602214829 | 1 | 0.5% |
| 0.05090582931 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.001682788181 | 1 | |
| 0.002165333208 | 1 | |
| 0.002688331847 | 1 | |
| 0.00276490608 | 1 | |
| 0.003657165131 | 1 | |
| 0.003746119751 | 1 | |
| 0.004710118217 | 1 | |
| 0.005615384814 | 1 | |
| 0.005940870585 | 1 | |
| 0.005944617741 | 1 |
| Value | Count | Frequency (%) |
| 0.7744447283 | 1 | |
| 0.7636754462 | 1 | |
| 0.6141453375 | 1 | |
| 0.3477130875 | 1 | |
| 0.2435585183 | 1 | |
| 0.2179150846 | 1 | |
| 0.2083755094 | 1 | |
| 0.1772122481 | 1 | |
| 0.1755816702 | 1 | |
| 0.1558968377 | 1 |
1st_curr_tfidf_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.007589426288 |
| Minimum | 0.0003896447887 |
|---|---|
| Maximum | 0.02386050154 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0003896447887 |
|---|---|
| 5-th percentile | 0.001023223277 |
| Q1 | 0.003846985604 |
| median | 0.00646586775 |
| Q3 | 0.009864114097 |
| 95-th percentile | 0.01796670161 |
| Maximum | 0.02386050154 |
| Range | 0.02347085675 |
| Interquartile range (IQR) | 0.006017128493 |
Descriptive statistics
| Standard deviation | 0.005316036545 |
|---|---|
| Coefficient of variation (CV) | 0.7004530176 |
| Kurtosis | 0.6404123376 |
| Mean | 0.007589426288 |
| Median Absolute Deviation (MAD) | 0.003069252825 |
| Skewness | 1.019513985 |
| Sum | 0.9866254175 |
| Variance | 2.826024455 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.003065826263 | 1 | 0.5% |
| 0.004234045763 | 1 | 0.5% |
| 0.004329730044 | 1 | 0.5% |
| 0.004534587439 | 1 | 0.5% |
| 0.01192001661 | 1 | 0.5% |
| 0.003862966129 | 1 | 0.5% |
| 0.006434186486 | 1 | 0.5% |
| 0.002190645455 | 1 | 0.5% |
| 0.004409882225 | 1 | 0.5% |
| 0.01311072643 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0003896447887 | 1 | |
| 0.0007049835 | 1 | |
| 0.0007232732558 | 1 | |
| 0.0007530491216 | 1 | |
| 0.0008405541667 | 1 | |
| 0.0009431983333 | 1 | |
| 0.0009764686207 | 1 | |
| 0.001080367857 | 1 | |
| 0.001293729545 | 1 | |
| 0.001497160179 | 1 |
| Value | Count | Frequency (%) |
| 0.02386050154 | 1 | |
| 0.02272727273 | 1 | |
| 0.02267330958 | 1 | |
| 0.02196901183 | 1 | |
| 0.02001269931 | 1 | |
| 0.01912179 | 1 | |
| 0.01804372918 | 1 | |
| 0.01787255681 | 1 | |
| 0.01756787192 | 1 | |
| 0.01600473256 | 1 |
1st_curr_lda_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.007443657467 |
| Minimum | 0.0002095214932 |
|---|---|
| Maximum | 0.02457363487 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0002095214932 |
|---|---|
| 5-th percentile | 0.002548282778 |
| Q1 | 0.003920876611 |
| median | 0.006895408428 |
| Q3 | 0.01036407592 |
| 95-th percentile | 0.0136626849 |
| Maximum | 0.02457363487 |
| Range | 0.02436411338 |
| Interquartile range (IQR) | 0.006443199308 |
Descriptive statistics
| Standard deviation | 0.004101948869 |
|---|---|
| Coefficient of variation (CV) | 0.5510663121 |
| Kurtosis | 1.502641198 |
| Mean | 0.007443657467 |
| Median Absolute Deviation (MAD) | 0.003108997307 |
| Skewness | 0.9198687379 |
| Sum | 0.9676754708 |
| Variance | 1.682598452 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.006405540584 | 1 | 0.5% |
| 0.00458027059 | 1 | 0.5% |
| 0.003232648023 | 1 | 0.5% |
| 0.004097849419 | 1 | 0.5% |
| 0.01198650291 | 1 | 0.5% |
| 0.006722311877 | 1 | 0.5% |
| 0.003049124563 | 1 | 0.5% |
| 0.00072330693 | 1 | 0.5% |
| 0.003977092082 | 1 | 0.5% |
| 0.0106961995 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0002095214932 | 1 | |
| 0.00072330693 | 1 | |
| 0.001114218953 | 1 | |
| 0.001776141399 | 1 | |
| 0.002319989807 | 1 | |
| 0.002463705894 | 1 | |
| 0.00254167689 | 1 | |
| 0.002556356642 | 1 | |
| 0.002581508016 | 1 | |
| 0.002581733867 | 1 |
| Value | Count | Frequency (%) |
| 0.02457363487 | 1 | |
| 0.0189185454 | 1 | |
| 0.01842834998 | 1 | |
| 0.01616458641 | 1 | |
| 0.01446806131 | 1 | |
| 0.01399008142 | 1 | |
| 0.01374956757 | 1 | |
| 0.01355649497 | 1 | |
| 0.01326802972 | 1 | |
| 0.01287594589 | 1 |
1st_curr_use_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.001763425116 |
| Minimum | 8.434639675 × 10-5 |
|---|---|
| Maximum | 0.008277623793 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 8.434639675 × 10-5 |
|---|---|
| 5-th percentile | 0.0002272789074 |
| Q1 | 0.0008684663079 |
| median | 0.001311715606 |
| Q3 | 0.002135890599 |
| 95-th percentile | 0.004904710261 |
| Maximum | 0.008277623793 |
| Range | 0.008193277397 |
| Interquartile range (IQR) | 0.001267424291 |
Descriptive statistics
| Standard deviation | 0.001500057454 |
|---|---|
| Coefficient of variation (CV) | 0.8506499318 |
| Kurtosis | 4.066056521 |
| Mean | 0.001763425116 |
| Median Absolute Deviation (MAD) | 0.0006193336455 |
| Skewness | 1.927897491 |
| Sum | 0.229245265 |
| Variance | 2.250172366 × 10-6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.001686505496 | 1 | 0.5% |
| 0.0007362184559 | 1 | 0.5% |
| 0.001447540102 | 1 | 0.5% |
| 0.0005295494624 | 1 | 0.5% |
| 0.002439334485 | 1 | 0.5% |
| 0.001155890264 | 1 | 0.5% |
| 0.001233448255 | 1 | 0.5% |
| 0.0001358944032 | 1 | 0.5% |
| 0.002406681829 | 1 | 0.5% |
| 0.00143836549 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 8.434639675 × 10-5 | 1 | |
| 0.0001242641561 | 1 | |
| 0.0001358944032 | 1 | |
| 0.0001646019035 | 1 | |
| 0.0001818264 | 1 | |
| 0.0002164432958 | 1 | |
| 0.0002228054348 | 1 | |
| 0.000232746485 | 1 | |
| 0.0004024691666 | 1 | |
| 0.0004232424944 | 1 |
| Value | Count | Frequency (%) |
| 0.008277623793 | 1 | |
| 0.007046005358 | 1 | |
| 0.00649811518 | 1 | |
| 0.006098761269 | 1 | |
| 0.005524154085 | 1 | |
| 0.005285793785 | 1 | |
| 0.004930787287 | 1 | |
| 0.004872838339 | 1 | |
| 0.004870150068 | 1 | |
| 0.004724827562 | 1 |
1st_curr_stm_adj
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 130 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 63 |
| Missing (%) | 32.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.006985924078 |
| Minimum | 0.0006883178409 |
|---|---|
| Maximum | 0.01888800252 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0.0006883178409 |
|---|---|
| 5-th percentile | 0.002305444263 |
| Q1 | 0.003935651623 |
| median | 0.006695388051 |
| Q3 | 0.008977120081 |
| 95-th percentile | 0.01345980475 |
| Maximum | 0.01888800252 |
| Range | 0.01819968468 |
| Interquartile range (IQR) | 0.005041468458 |
Descriptive statistics
| Standard deviation | 0.003829646204 |
|---|---|
| Coefficient of variation (CV) | 0.5481946499 |
| Kurtosis | 0.5827017658 |
| Mean | 0.006985924078 |
| Median Absolute Deviation (MAD) | 0.002743480786 |
| Skewness | 0.9001967536 |
| Sum | 0.9081701301 |
| Variance | 1.466619005 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.003550080413 | 1 | 0.5% |
| 0.005592480762 | 1 | 0.5% |
| 0.00349210091 | 1 | 0.5% |
| 0.004831091282 | 1 | 0.5% |
| 0.01340751104 | 1 | 0.5% |
| 0.008756489701 | 1 | 0.5% |
| 0.002902533256 | 1 | 0.5% |
| 0.002998388693 | 1 | 0.5% |
| 0.003822790585 | 1 | 0.5% |
| 0.01107296719 | 1 | 0.5% |
| Other values (120) | 120 | |
| (Missing) | 63 |
| Value | Count | Frequency (%) |
| 0.0006883178409 | 1 | |
| 0.0009845635344 | 1 | |
| 0.001549924528 | 1 | |
| 0.001555232888 | 1 | |
| 0.001905146018 | 1 | |
| 0.002196125401 | 1 | |
| 0.002273599935 | 1 | |
| 0.002344365109 | 1 | |
| 0.002626796698 | 1 | |
| 0.002833015242 | 1 |
| Value | Count | Frequency (%) |
| 0.01888800252 | 1 | |
| 0.01886959884 | 1 | |
| 0.01811448267 | 1 | |
| 0.01590918734 | 1 | |
| 0.01572122068 | 1 | |
| 0.01352755029 | 1 | |
| 0.0135025905 | 1 | |
| 0.01340751104 | 1 | |
| 0.01307142027 | 1 | |
| 0.01291238536 | 1 |
tfidf_distance
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.064712286 |
| Minimum | 0 |
|---|---|
| Maximum | 8.805528793 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.6192621589 |
| Q3 | 1.457402639 |
| 95-th percentile | 4.336997896 |
| Maximum | 8.805528793 |
| Range | 8.805528793 |
| Interquartile range (IQR) | 1.457402639 |
Descriptive statistics
| Standard deviation | 1.432658755 |
|---|---|
| Coefficient of variation (CV) | 1.345583003 |
| Kurtosis | 5.270935841 |
| Mean | 1.064712286 |
| Median Absolute Deviation (MAD) | 0.6192621589 |
| Skewness | 2.054736816 |
| Sum | 205.4894711 |
| Variance | 2.052511108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 2.55466716 | 1 | 0.5% |
| 1.346237004 | 1 | 0.5% |
| 0.9828487206 | 1 | 0.5% |
| 3.680461466 | 1 | 0.5% |
| 2.866323628 | 1 | 0.5% |
| 0.3675777912 | 1 | 0.5% |
| 1.50062342 | 1 | 0.5% |
| 0.3533120155 | 1 | 0.5% |
| 2.627345592 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.02017331123 | 1 | 0.5% |
| 0.03110074997 | 1 | 0.5% |
| 0.05659192801 | 1 | 0.5% |
| 0.05692410469 | 1 | 0.5% |
| 0.08299434185 | 1 | 0.5% |
| 0.08736741543 | 1 | 0.5% |
| 0.1200658083 | 1 | 0.5% |
| 0.1280924678 | 1 | 0.5% |
| 0.1383602619 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 8.805528793 | 1 | |
| 5.926186346 | 1 | |
| 5.665635705 | 1 | |
| 5.363589868 | 1 | |
| 4.714640707 | 1 | |
| 4.614361033 | 1 | |
| 4.563309059 | 1 | |
| 4.48809563 | 1 | |
| 4.398849934 | 1 | |
| 4.370397478 | 1 |
lda_distance
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8243181345 |
| Minimum | 0 |
|---|---|
| Maximum | 5.310201276 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.5798510483 |
| Q3 | 1.150007772 |
| 95-th percentile | 2.687808073 |
| Maximum | 5.310201276 |
| Range | 5.310201276 |
| Interquartile range (IQR) | 1.150007772 |
Descriptive statistics
| Standard deviation | 0.9664345238 |
|---|---|
| Coefficient of variation (CV) | 1.172404783 |
| Kurtosis | 3.256722106 |
| Mean | 0.8243181345 |
| Median Absolute Deviation (MAD) | 0.5798510483 |
| Skewness | 1.675308658 |
| Sum | 159.0934 |
| Variance | 0.9339956887 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 2.182414969 | 1 | 0.5% |
| 1.135737341 | 1 | 0.5% |
| 0.7338111011 | 1 | 0.5% |
| 2.598592762 | 1 | 0.5% |
| 1.352163196 | 1 | 0.5% |
| 0.5455480628 | 1 | 0.5% |
| 0.3011339598 | 1 | 0.5% |
| 0.2367953335 | 1 | 0.5% |
| 2.341969053 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.009009424209 | 1 | 0.5% |
| 0.106568484 | 1 | 0.5% |
| 0.1136503332 | 1 | 0.5% |
| 0.23170049 | 1 | 0.5% |
| 0.2367953335 | 1 | 0.5% |
| 0.2703033666 | 1 | 0.5% |
| 0.2703666181 | 1 | 0.5% |
| 0.2783459033 | 1 | 0.5% |
| 0.2847315518 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 5.310201276 | 1 | |
| 4.299676602 | 1 | |
| 3.981688847 | 1 | |
| 3.608418739 | 1 | |
| 3.518476125 | 1 | |
| 3.422846227 | 1 | |
| 3.152002994 | 1 | |
| 3.026661364 | 1 | |
| 3.014511213 | 1 | |
| 2.813127573 | 1 |
use_distance
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.177404477 |
| Minimum | 0 |
|---|---|
| Maximum | 1.839643225 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.08428697711 |
| Q3 | 0.242137435 |
| 95-th percentile | 0.7334038791 |
| Maximum | 1.839643225 |
| Range | 1.839643225 |
| Interquartile range (IQR) | 0.242137435 |
Descriptive statistics
| Standard deviation | 0.2513756453 |
|---|---|
| Coefficient of variation (CV) | 1.416963368 |
| Kurtosis | 10.67329134 |
| Mean | 0.177404477 |
| Median Absolute Deviation (MAD) | 0.08428697711 |
| Skewness | 2.667428981 |
| Sum | 34.23906407 |
| Variance | 0.06318971504 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.2721637196 | 1 | 0.5% |
| 0.09802493714 | 1 | 0.5% |
| 0.3285916033 | 1 | 0.5% |
| 0.2856437567 | 1 | 0.5% |
| 0.122433367 | 1 | 0.5% |
| 0.07973911799 | 1 | 0.5% |
| 0.08428697711 | 1 | 0.5% |
| 0.02295066144 | 1 | 0.5% |
| 0.5079892852 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.005343358713 | 1 | 0.5% |
| 0.010909584 | 1 | 0.5% |
| 0.02295066144 | 1 | 0.5% |
| 0.02483414906 | 1 | 0.5% |
| 0.02570967874 | 1 | 0.5% |
| 0.03210179473 | 1 | 0.5% |
| 0.03364362066 | 1 | 0.5% |
| 0.03753879925 | 1 | 0.5% |
| 0.03899271319 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 1.839643225 | 1 | |
| 0.9981496239 | 1 | |
| 0.9761781719 | 1 | |
| 0.9259964936 | 1 | |
| 0.8976717033 | 1 | |
| 0.8100906049 | 1 | |
| 0.8050738149 | 1 | |
| 0.7783798098 | 1 | |
| 0.7424658286 | 1 | |
| 0.7404241889 | 1 |
stm_distance
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 131 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.131337558 |
| Minimum | 0 |
|---|---|
| Maximum | 13.761275 |
| Zeros | 63 |
| Zeros (%) | 32.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.5333420282 |
| Q3 | 1.453148717 |
| 95-th percentile | 4.325649139 |
| Maximum | 13.761275 |
| Range | 13.761275 |
| Interquartile range (IQR) | 1.453148717 |
Descriptive statistics
| Standard deviation | 1.739749441 |
|---|---|
| Coefficient of variation (CV) | 1.537781035 |
| Kurtosis | 15.77721491 |
| Mean | 1.131337558 |
| Median Absolute Deviation (MAD) | 0.5333420282 |
| Skewness | 3.237533468 |
| Sum | 218.3481487 |
| Variance | 3.026728118 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 1.852420044 | 1 | 0.5% |
| 1.559017085 | 1 | 0.5% |
| 0.7927069066 | 1 | 0.5% |
| 4.211583706 | 1 | 0.5% |
| 2.577173316 | 1 | 0.5% |
| 1.420003692 | 1 | 0.5% |
| 0.3574582867 | 1 | 0.5% |
| 0.4350027129 | 1 | 0.5% |
| 3.361402 | 1 | 0.5% |
| Other values (121) | 121 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.06664675472 | 1 | 0.5% |
| 0.103935994 | 1 | 0.5% |
| 0.2493890969 | 1 | 0.5% |
| 0.2554971727 | 1 | 0.5% |
| 0.2564407247 | 1 | 0.5% |
| 0.2853324978 | 1 | 0.5% |
| 0.2884652441 | 1 | 0.5% |
| 0.3030384246 | 1 | 0.5% |
| 0.324062172 | 1 | 0.5% |
| Value | Count | Frequency (%) |
| 13.761275 | 1 | |
| 7.655179479 | 1 | |
| 7.011777933 | 1 | |
| 6.548253206 | 1 | |
| 6.469361033 | 1 | |
| 5.623032088 | 1 | |
| 5.45806128 | 1 | |
| 5.290515012 | 1 | |
| 4.525869525 | 1 | |
| 4.496747288 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| code | country | 1st_const_year | 2nd_const_year | 1st_2nd_tfidf | 1st_current_tfidf | 1st_2nd_lda | 1st_current_lda | 1st_2nd_use | 1st_current_use | 1st_2nd_stm | 1st_current_stm | constitutional_time | first_regime_time | 1st_2nd_tfidf_adj | 1st_2nd_lda_adj | 1st_2nd_use_adj | 1st_2nd_stm_adj | 1st_curr_tfidf_adj | 1st_curr_lda_adj | 1st_curr_use_adj | 1st_curr_stm_adj | tfidf_distance | lda_distance | use_distance | stm_distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | AFG | Afghanistan | 1923 | 1931.0 | 0.353630 | 0.303517 | 0.490204 | 0.634149 | 0.055123 | 0.166964 | 0.180809 | 0.351458 | 99 | 8.0 | 0.044204 | 0.061275 | 0.006890 | 0.022601 | 0.003066 | 0.006406 | 0.001687 | 0.003550 | 2.554667 | 2.182415 | 0.272164 | 1.852420 |
| 1 | ALB | Albania | 1925 | 1928.0 | 0.513447 | 0.636341 | 0.437473 | 0.788622 | 0.082878 | 0.102103 | 0.526745 | 0.751644 | 97 | 3.0 | 0.171149 | 0.145824 | 0.027626 | 0.175582 | 0.006560 | 0.008130 | 0.001053 | 0.007749 | 2.818089 | 2.412450 | 0.778380 | 3.296292 |
| 2 | DZA | Algeria | 1963 | 1996.0 | 0.650947 | 0.650947 | 0.536840 | 0.536840 | 0.132969 | 0.132969 | 0.483908 | 0.483908 | 59 | 33.0 | 0.019726 | 0.016268 | 0.004029 | 0.014664 | 0.011033 | 0.009099 | 0.002254 | 0.008202 | 0.650947 | 0.536840 | 0.132969 | 0.483908 |
| 3 | AND | Andorra | 1993 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 29 | 29.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 4 | AGO | Angola | 1975 | 2010.0 | 0.317443 | 0.317443 | 0.571623 | 0.571623 | 0.305411 | 0.305411 | 0.489318 | 0.489318 | 47 | 35.0 | 0.009070 | 0.016332 | 0.008726 | 0.013981 | 0.006754 | 0.012162 | 0.006498 | 0.010411 | 0.317443 | 0.571623 | 0.305411 | 0.489318 |
| 5 | ATG | Antigua Barbuda | 1981 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 41 | 41.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 6 | ARG | Argentina | 1819 | 1826.0 | 0.505489 | 0.794305 | 0.251727 | 0.733257 | 0.014585 | 0.043938 | 0.252834 | 0.461541 | 203 | 7.0 | 0.072213 | 0.035961 | 0.002084 | 0.036119 | 0.003913 | 0.003612 | 0.000216 | 0.002274 | 1.149967 | 0.881207 | 0.037539 | 0.714374 |
| 7 | ARM | Armenia | 1995 | 2005.0 | 0.064511 | 0.371961 | 0.122443 | 0.377732 | 0.034281 | 0.096356 | 0.180414 | 0.352928 | 27 | 10.0 | 0.006451 | 0.012244 | 0.003428 | 0.018041 | 0.013776 | 0.013990 | 0.003569 | 0.013071 | 0.389074 | 0.419085 | 0.084017 | 0.533342 |
| 8 | AUS | Australia | 1901 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 121 | 121.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 9 | AUT | Austria | 1920 | 1934.0 | 0.338217 | 0.338217 | 0.113650 | 0.113650 | 0.048903 | 0.048903 | 0.403290 | 0.403290 | 102 | 14.0 | 0.024158 | 0.008118 | 0.003493 | 0.028806 | 0.003316 | 0.001114 | 0.000479 | 0.003954 | 0.338217 | 0.113650 | 0.048903 | 0.403290 |
Last rows
| code | country | 1st_const_year | 2nd_const_year | 1st_2nd_tfidf | 1st_current_tfidf | 1st_2nd_lda | 1st_current_lda | 1st_2nd_use | 1st_current_use | 1st_2nd_stm | 1st_current_stm | constitutional_time | first_regime_time | 1st_2nd_tfidf_adj | 1st_2nd_lda_adj | 1st_2nd_use_adj | 1st_2nd_stm_adj | 1st_curr_tfidf_adj | 1st_curr_lda_adj | 1st_curr_use_adj | 1st_curr_stm_adj | tfidf_distance | lda_distance | use_distance | stm_distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 183 | USA | United states | 1789 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 233 | 233.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 184 | URY | Uruguay | 1830 | 1918.0 | 0.070556 | 0.852828 | 0.230569 | 0.661282 | 0.024275 | 0.201772 | 0.236573 | 0.582161 | 192 | 88.0 | 0.000802 | 0.002620 | 0.000276 | 0.002688 | 0.004442 | 0.003444 | 0.001051 | 0.003032 | 1.279092 | 0.927896 | 0.171229 | 1.410233 |
| 185 | UZB | Uzbekistan | 1992 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 30 | 30.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0.000000 | 0.000000 | 0.000000 | 0.000000 |
| 186 | VUT | Vanuatu | 1979 | 1980.0 | 0.031101 | 0.031101 | 0.009009 | 0.009009 | 0.005343 | 0.005343 | 0.066647 | 0.066647 | 43 | 1.0 | 0.031101 | 0.009009 | 0.005343 | 0.066647 | 0.000723 | 0.000210 | 0.000124 | 0.001550 | 0.031101 | 0.009009 | 0.005343 | 0.066647 |
| 187 | VEN | Venezuela | 1830 | 1858.0 | 0.495311 | 0.690389 | 0.181846 | 0.760860 | 0.047112 | 0.185052 | 0.184205 | 0.780078 | 192 | 28.0 | 0.017690 | 0.006495 | 0.001683 | 0.006579 | 0.003596 | 0.003963 | 0.000964 | 0.004063 | 5.665636 | 3.608419 | 0.503704 | 13.761275 |
| 188 | VDR | Vietnam | 1960 | 1980.0 | 0.885566 | 0.891462 | 0.240032 | 0.263229 | 0.052652 | 0.056959 | 0.253492 | 0.228797 | 62 | 20.0 | 0.044278 | 0.012002 | 0.002633 | 0.012675 | 0.014378 | 0.004246 | 0.000919 | 0.003690 | 1.157357 | 0.482546 | 0.071202 | 0.482289 |
| 189 | YEM | Yemen Arab Republic | 1970 | 1991.0 | 0.913529 | 0.913529 | 0.647501 | 0.647501 | 0.065579 | 0.065579 | 0.663185 | 0.663185 | 52 | 21.0 | 0.043501 | 0.030833 | 0.003123 | 0.031580 | 0.017568 | 0.012452 | 0.001261 | 0.012754 | 0.913529 | 0.647501 | 0.065579 | 0.663185 |
| 190 | YUG | Yugoslavia | 1921 | 1931.0 | 0.736563 | 0.929297 | 0.484082 | 0.731703 | 0.035465 | 0.133398 | 0.453762 | 0.687188 | 101 | 10.0 | 0.073656 | 0.048408 | 0.003546 | 0.045376 | 0.009201 | 0.007245 | 0.001321 | 0.006804 | 4.488096 | 3.981689 | 0.742466 | 5.623032 |
| 191 | ZMB | Zambia | 1964 | 1973.0 | 0.475826 | 0.086892 | 0.110700 | 0.148269 | 0.007083 | 0.009547 | 0.053468 | 0.202029 | 58 | 9.0 | 0.052870 | 0.012300 | 0.000787 | 0.005941 | 0.001498 | 0.002556 | 0.000165 | 0.003483 | 0.963551 | 0.231700 | 0.024834 | 0.255497 |
| 192 | ZWE | Zimbabwe | 1965 | 1969.0 | 0.249281 | 0.912270 | 0.370943 | 0.824679 | 0.032659 | 0.092031 | 0.329574 | 0.688878 | 57 | 4.0 | 0.062320 | 0.092736 | 0.008165 | 0.082393 | 0.016005 | 0.014468 | 0.001615 | 0.012086 | 1.118774 | 1.431619 | 0.132359 | 1.478264 |